# Spatiotemporal Feature Extraction
Videomae Base Finetuned Ucf101 Subset
A video understanding model fine-tuned on a subset of UCF101 based on the VideoMAE base model, achieving 95.71% accuracy
Video Processing
Transformers

V
anitavero
14
0
Videomae Base Ssv2 Finetuned Rwf2000
A video understanding model based on the VideoMAE architecture, fine-tuned on the RWF-2000 dataset for violence detection tasks
Video Processing
Transformers

V
lmazzon70
30
0
Videomae Base Ssv2
VideoMAE is a self-supervised video pre-training model based on masked autoencoder, pre-trained for 2400 epochs on the Something-Something-v2 dataset.
Video Processing
Transformers

V
MCG-NJU
454
2
Videomae Large Finetuned Kinetics
VideoMAE is a self-supervised video pre-training model based on masked autoencoder, fine-tuned on the Kinetics-400 dataset for video classification tasks.
Video Processing
Transformers

V
MCG-NJU
4,657
12
Featured Recommended AI Models